Static Fault-Tolerant Real-Time Scheduling with "Pseudo-topological" Orders
نویسندگان
چکیده
We give a graph-theoretical model for off-line fault-tolerant scheduling of dataflow algorithms onto multiprocessor architectures with distributed memory. Our framework allows the modeling of both processor and communication channel failures of the “fail silent” type (either transient or permanent), and failure masking is achieved by replicating operations and data communications. We show that, in general, the graph representing a fault-tolerant scheduling may have circuits; hence, the classical computation of starting and ending times of the operations and communications, based upon a topological order, is inapplicable. We thus provide a notion of “pseudo-topological order” that permits the computation of the starting and ending times even in the case of cyclic graphs. We also derive algorithms for computing the timeouts that are used for failure detection.
منابع مشابه
Reliability-Driven Fault Tolerant Scheduling Heuristics for Distributed Embedded Real-Time Systems
We present a new scheduling approach to produce automatically a fault tolerant distributed schedule for critical distributed and real-time embedded systems. The approach that we propose take as input a set of operations (tasks), a target distributed architecture, some distribution constraints, some indications on the execution times of the operations on the processors of the target architecture...
متن کاملAn E cient Backup - Overloading forFault - Tolerant Scheduling of Real - Time TasksR
Many time-critical applications require dynamic scheduling with predictable performance. Tasks corresponding to these applications have deadlines to be met despite the presence of faults. In this paper, we propose a technique called dynamic grouping, to be used with backup overloading in a primary-backup based fault-tolerant dynamic scheduling algorithm in multiprocessor real-time systems. In d...
متن کاملScheduling and Optimization of Fault-Tolerant Distributed Embedded Systems
Safety-critical applications have to function correctly even in presence of faults. This thesis deals with techniques for tolerating effects of transient and intermittent faults. Reexecution, software replication, and rollback recovery with checkpointing are used to provide the required level of fault tolerance. These techniques are considered in the context of distributed real-time systems wit...
متن کاملDHARMA : A Tool for Evaluating Dynamic Scheduling
A majority of today's real-time systems assume a priori knowledge of task characteristics and hence are based on static designs which contribute to their high cost and innexibility. The next generation hard real-time systems must be designed to be dynamic and exible. This provides the motivation to study various dynamic scheduling proposals. In this paper, we propose new algorithms for scheduli...
متن کاملFault-Tolerant Static Scheduling for Real-Time Distributed Embedded Systems
We present in this paper a heuristic for producing automatically a distributed fault-tolerant schedule of a given data-flow algorithm onto a given distributed architecture. The faults considered are processor failures, with a failsilent behavior. Fault-tolerance is achieved with the software redundancy of computations and the time redundancy of data-dependencies.
متن کامل